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ABSTRACT 

We examine systematic differences in tfie derived X-ray prope rties of galax y clusters as reported 
by thre e different groups: IVikhlinin eFall (|2009a[ ). iMantz et ail ()2010bl ). and jPlanck Collaboration! 
(|2011bl ). The sample overlap between any two pairs of works ranges between 16 to 28 galaxy clusters 
in common. We find systematic differences in most reported properties, including the total cluster 
mass, Msnn- The most extrem e ca se is an average 45% ± 5% difference in cluster mass between the 
IPlanck Collaboration! (j2011b[) and IMantz et al.l (j2010b[ ). for clusters at z > 0.13 (averaged over 16 
clusters). These mass differences induce differences in cluster observables defined within an i?5oo 
aperture. After accounting for aperture differences, wc find very good agreement in gas mass es- 
timates between the different groups. However, the soft-band X-ray luminosity, Lx, core-excised 
spectroscopic temperature, Tx, and gas thermal energy, Yx = MgasTx display mean differences at the 
5%-15% level. We al s o find that the low {z < 0.13) and high (z > 0.13) galaxy cluster samples in 
IPlanck Collaboration! (j2011bf ) appear to be systematically different: the Ygz/Yx ratio for these two 
sub-samples is \n(Ysz/Yx) = -0.06 ±0.04 and \n{Ysz/Yx) = 0.08 ±0.04 for the low and high redshift 
sub-samples respectively. 
Subject headings: cosmology: clusters 



1. INTRODUCTION 

X-rays studies of galaxy clusters are an established 
method for investigating the physics of the intra- 
cluster medium through cl uster scaling r elations (e.g., 
Pratt ct al. 2009; Vikhhni n et al.l l2009bl : lArnaud et ahl 
2010HMantz et al.i2010bD , as well as fo r constraining cos- 
mology with cluster abundances (e.g . . iHenrv et alll2009l : 
IVikhlinin et"alll2009at IMantz et al.l l2010aD . With clus- 
ter samples of increasi ng size and improving qu ality ap- 
proaching —e^gJ^CCSjUoid^D^i^^ XXlE, 
and eRosita (|Pillepich et al.ll2011[ ) — accuratelv quanti- 
fying the level of systematic uncertainties in data analysis 
is becoming a significant issue. Instrumental uncertainty 
has long been recognized as a source of X-ray modeling 
error, and the relative cross-calibration of the Chandra 
and XMM-Newton observatories has bee n explor e d in a 
vari ety of works, most rece ntlv iNcvalainc n et al.l ()2010l ) 
■ and iTsuiimoto et al.l ()2011[ ). These studies find w 10% 
systematic differen ces in the effect ive area of the two in- 
[ struments (see also lSnowdenll200^ . as well as differences 
of « 10% — 15% in temperature estimates. 
Differences in instrumental calibration can be fur- 
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ther compounded by methodological differences in the 
data analysis between different groups. For instance, 
it has been suggested that the choice of parameteriza- 
tion of the intra-cluster medium plays a n important role 
in th e derived cluster scaling reiations (jMantz fc AllenI 
1201 Ih . Theoretically, these instrumental and method- 
ological sources of error should be independent. In prac- 
tice, however, the degree of independence is compromised 
because any given research group often relies heavily on 
a single method applied to a specific instrument. Conse- 
quently, disentangling these two sources of possible sys- 
tematic errors is difficult. 

Relative to instrumental calibration, methodological 
sources of error have received comparatively little at- 
tention. Codes to compute plasma emission, principally 
SPEX/MEKAL and APEC, produce similar continuum, 
but differ in expectations for line emission even at rel- 
atively low spectral resolutior0. In addition, deriving 
X-ray observables from raw photon counts requires that 
the X-ray background and hydrogen column density to 
the cluster be estimated, points sources be masked, intra- 
cluster gas metallicities fit for or assumed, and data cuts 
applied (for instance, with respect to the energy range 
used to fit for the spectral X-ray temperature). All of 
these procedures can lead to systematic differences of 
varying degree. 

In this work, we take a pragmatic approach to es- 
timating the level of systematic differences in derived 
X-ray cluster properties. With the maturation of the 
XMM-Newton and Chandra observatories, the likeli- 
hood of a nearby cluster having been sensitively im- 
aged by both observatories has increased to the point 
where several dozen such systems now exist. We exam- 
ine the differences in pairs of such measurements per- 
formed by different groups, focusing on three samples 
used for recent cosmological studies: the Vikhlinin sam- 

^ http:/ /www. atomdb.org/issues.php 
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pie, composed of the syste ms in iVikhlinin et all (|2OO60 
and lVikhlinin et alj ()2009aD . henceforth referred to as the 
V0 9 sample: th e Mant z sample, composed of systems 
in iMantz et al.l ()2010bf ). henceforth referred to as the 
MIO sampl e: and the Pl anck XMM sample, composed of 
systems in iPlanck Collab oration (■2011bf ). henceforth re- 
ferred to as the Pll-LS sample. The "-LS" signifies that 
this is the local scaling relations paper from the Planck 
Early Results series of papers. We will a l so br iefly con- 
sider clusters in the iPointecouteau et all (|2005[ ) sample. 
Published properties include the soft-band X-ray lumi- 
nosity, Lx, intracluster gas mass, Mgas, gas temperature, 
Tx, gas thermal energy, Ix, and the hydrostatic mass 
M500, defined below. We summarize the observables un- 
der consideration in Table [T] 

Our comparisons arc based on a total of 16 clusters 
shared between the Vikhlinin and Mantz samples, 23 
common clusters between the Vikhlinin and Planck sam- 
ples, and 28 common clusters between the Planck and 
Mantz samples. We note that of these last 28 sys- 
tems, only 12 of th em have temperatures estimated by 
IMantz et all (j2010bD . Consequently, the Tx and Yx com- 
parison for this pair of samples is limited to these sys- 
tems only. We have restricted ourselves to low-redshift 
{z < 0.3) systems in this work. We also note that there 
are 6 clusters in common between IPointecouteau et al.l 
(|2005f ) and the Vikhlinin samples, some of which appear 
in the IPlanck Collaborat ion (20liB) sample as well. 

To extent that we can, we take care to ensure that all 
cluster observables are consistently defined and measured 
within the same apertures, and rescale all measurements 
to the same fiducial flat ACDM cosmology with ilm = 0.3 
and h = 0.7. This last rescaling only affects the Vikhlinin 
measurements, as they adopted h ~ 0.72 as their fiducial 
Hubble parameter. Our corrections follow the standard 
Lx oc h-"^, Mgas oc ft.~5/2^ Yx oc and M cx /i^^ 

scalings. 

This is the first in a pair of papers that characterize the 
overall systematic differences in X-ray observables be- 
tween three independent X-ray analyses, and the impact 
that these differences have o n cluster scaling r elations. 
The second paper in the series (jRozo et al.ll2012ct ) utilizes 
the results from this work to demonstrate that the differ- 
ences in the scaling relations from X-ray cluster samples 
are driven primarily by the systematic differences in the 
X-ray cluster observables: all additional sources of com- 
plication, including fitting methods and modeling of se- 
lection function effects, are sub-dominant relative to the 
overall systematic difference s in the X-ray otj servables 
themselves. A third paper (jRozo et al.ll2012"bl ) extends 
our an alysis to the o p tically selected maxBCG cluster 
catalog iKoester et al.l (120071). focusing in pa,rticula r on 
the tension noted in IPlanck CollaborationI ()2011d ) be- 
tween the observed and expected SZ signal of maxBCG 
clusters. 

The layout of the paper is as follows: Section [2] briefly 
summarizes each data set used in our comparison. Sec- 
tion [3] presents the results of our investigation, including 
some general discussion in section 13.61 Section 4 dis- 
cusses the surprisingly large amount of evolution in the 
mass offset in the (Pll-LS)-MlO mass comparison. A 
summary of the highlights of this work is presented in 
section [SJ 



Unless otherwise noted, all masses and associated ra- 
dial and angular scales are defined within a radius i?5oo 
that encompasses a mean interior density of 500 times 
the critical density of the universe at the cluster red- 
shift, = 3i/2(z)/87rG. A flat ACDM cosmology 
with present-epoch matter density Qm = 0.3 is assumed 
throughout. 

1.1. A Note on Statistical Errors 

The core of our analysis consists of comparing a derived 
cluster property X (e.g. Lx, Tx, etc) as estimated in 
two different works A and B. Given two cluster samples 
A and B, for all clusters in both of these samples we 
compute the difference in natural logarithm, A In X = 
ln{XA/ Xb)- Averaging over all pairs in common to A 
and B results in a mean difference (A In A')ab = r ± Ar, 
where r is the mean offset and Ar the Icr error in the 
mean. Errors are computed via bootstrap resampling, 
ensuring that the quoted uncertainties reflects the total 
variance in the data. 

In principle, our analysis should employ the statis- 
tical errors of the cluster observables quoted for each 
sample. However, we argue that the published uncer- 
tainties are unable to account for the variance seen in 
sample pairs. For example, using the published er- 
rors, the null hy pothesis that the mass of a ga laxy clus- 
ter estimated bv IPlanck CollaborationI (120111)1) is cq nsis- 
tent with that estimated bv IVikhlinin et al. F BoOQal) has 
X^/dof = 307/23. Even after correcting all masses by 
the mean systematic difference between these two data 
sets, we flnd x^/dof = 75/22. This demonstrates that 
the quoted uncertainties do not account for the full level 
of variation between the samples, and explains our re- 
liance on bootstrap resampling for estimating uncertain- 
tiesH 

2. CLUSTER SAMPLES 

The galaxy cluster data used in this work consists of 
low rcdshift {z < 0.3) clusters drawn from the three sam- 
ples listed in Table [21 We describe each sample in turn. 
Tables containing the galaxy clusters shared by each pair 
of cluster samples arc detailed in Appendix \X\ 

V09 . The low redshift clusters in the Vikhlinin sample 
were selected from a variety of sources drawn from the 
ROSAT All -Sky Survey(RASS) — the Brightest Clus- 
ter Sample (lEbeling et al.l[200a BCS), t he ROSAT-ESO 
Flux Limited X-ray sample (REFLEX iBohringer et al.l 
|2004( ). and the H ighest X-ray Flux Galaxy C luster Sam- 
ple fHIFLUCCS IReiorich fc BohringeH[200l . After ap- 
plying foreground cuts, the total area probed was 8. 14 sr. 
X-ray fluxes were remeasured using pointed ROSAT 
PSPC data when available, and a flux cut of fx > 
1.3 X lO^^crgs s~^ cm~^ was applied. The low redshift 
sample contains 49 clusters with median estimated mass 
of « 4 X 10^"* AIq that reside primarily at z < 0.1, with a 
few high mass systems out to z « 0.2. All systems have 
been imaged by Chandra, and these data are used for 

More generally, if e is the typical fractional error in Xi, then 
the expected error on the mean A In Xi estimated from galaxy 
clusters is y^2/Ne. The median statistical errors for the clusters 
under consideration have e 1% — 3%, so the expected error in 
the mean is Ar < 0.01 (assuming A'^ = 20), roughly a factor of two 
smaller than our direct estimates from bootstrap resampling. 
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TABLE 1 

Cluster Observables 



Observable Definition 



Lx soft, [0.1,2.4] keV, X-ray luminosity within a cylindrical aperture of radius, -R500. 

Mgas Gas mass within a sphere of radius, -R500. 

Tx Core excised spectroscopic X-ray temperature with cylindrical annulus, _R 6 [0.15, 1]^500-^ 

Yx Gas thermal energy as derived from the product, A^fgaslx. 

M500 Total mass, calibrated using hydrostatic mass estimates. 



^Note that different works use slightly different energy ranges when fitting cluster spectra, a potential 
source of variation in Tx that we do not account for. 



TABLE 2 
Data Sample Characteristics 



Name 


Afci 


X-ray Instrument(s) 


Publication(s) 


Sample Notes 


MIO 

Pll-LS 

V09 


238 

62 

85 


Chandra ACIS, ROSAT PSPC 

XMM-Newton Epic 

Chandra ACIS, ROSAT PSPC 


Mantz et a . f2010a) 

Planck Col aboration f2011b~) 

Vikhlinin et al. (2006. 2009^1 


Joint cosmology and scaling relation analysis 
Scaling relation analysis; joint SZ-|- X-ray selection 
Separate scaling relation and cosmology analyses 



estimating cluster temperatures. For systems where the 
Chandra field of view is smaller than the cluster radius, 
ROSAT da.ta. is used for estimating both Lx and Mgas- 
Otherwise, all data comes from Chandra. The cluster 
masses we use in our comparison are those derived using 
the M^QQ-Yx relation. This relation is calibrated using 
relaxed galaxy clusters, and relies on hydrostatic mass 
estimates derive d from the clusters' X -ray gas and tem- 
perature profile (jVikhlinin et al.l[2006[ ). 

MIO . The Mantz clusters sample is likewise drawn 
from three wide-area catalogs from the ROSAT AW-Sky 
Survey: BCS, REFLEX, and the bright sub-sample of 
the M assive Cluster Survey (Bright MACS lEbeiing et al.l 
|2010[) . Each sample covers a different region of the uni- 
verse: BCS contains z < 0.3 clusters in the Northern 
sky, REFLEX contains z < 0.3 systems in the south- 
ern sky, and Bright MACS covers 0.3 < z < 0.5 at 
declinations > —40°. Clusters are selected by apply- 
ing a redshift-dependent flux cut that approximates a 
constant luminosity cut corresponding to a mass cut 
M > 4.7 X lO^'* Mq. The number of galaxy clusters 
drawn fro m each of th e se sam ples is 78, 126, and 34. 
As in V09. lMantz et al.l (|2010b( ) rely on Chandra and/or 
ROSATPSPC point ed data for flux and t emperature es- 
timates. In all cases. iMantz et al.l ()2010b[ ) quote ROSAT 
calibrated X-ray luminosities and Mgas values. For the 
MIO sample, cluster total masses are esti mated from 
Mgas b ased on the M-Mgas scaling relation of lAUen et al.l 
(|2008[ ). This scaling relation was calibrated using hydro- 
static masses for relaxed, massive galaxy clusters, where 
mass was estimated within the aperture i?2500 enclosing 
an overdensity A = 2500 relative to critical. The scal- 
ing relation M25oo~Mgas was then extended to the outer 
radius i?5oo bas ed on nu merical simulatio ns. 

Pll-LS . The lElanckCoUaboration (20lTSh sample is 
a subset of the Planck Early Sunyaev-Zel'd ovich clus- 
ter sample fESZ lPlanck CollaboratiOT3l2011aD . The ESZ 
sample is selected from the Planck all-sky survey using 
a mu lti-frequency, matched filter method (jMelin et al.l 
12006( 1 that identifies clusters by their thermal Sunyaev- 
Zel'dovich (SZ) imprint on the cosmic microwave back- 
ground. Clusters are selected by applying a signal-to- 
noise cut S/N > 6, and it is also required that the 



clusters be identified by at least one additional indepen- 
dent SZ-cluster finding algorithm. The resulting galaxy 
clusters are cross-correlat ed against the Meta -Catalog 
of X-ray Clusters (MCXC IPiffaretti et al.l[20TTI ). result- 
ing in 158 cluster matches. These clusters are searched 
for in the XMM-Newton science archive, resulting in 88 
matches. After removing clusters contaminated by fiares, 
or excessively asymmetric galaxy clusters, the final clus- 
ter sample comprises 62 galaxy clusters. The galaxy clus- 
ters cover the redshift range z < 0.35, with an additional 
4 galaxy clusters in the redshift range z g [0.35,0.5]. 
The sel ection function described in lPlanck Collaboration! 
()2011al) has a median mass that increases with redshift. 
All X-ray quantities — Lx, Mgas, ^x, and lx — are 
estimated from the archival XMM-Newton data. To- 
tal masses are estimated from lx by r e lying on the 
M500-IX scaling relation of lArnaud et al.l (j2010t) . which 
is calibrated using hydrostatic mass estimates of relaxed 
galaxy clusters. 

Neither MIO nor Pll-LS classify the systems in their 
samples as relaxed/non-relaxed. However, Pll-LS does 
label cool-core clusters, which is generally considered a 
good proxy for relaxed systems. For comparisons involv- 
ing the V09 sample, we employ the relaxed/unrelaxed 
characterization of that work to investigate trends with 
dynamical state. For the MlO-Pll-LS comparison, we 
employ the cool core characterization of Pll-LS. 



3. COMPARISON OF DERIVED CLUSTER PROPERTIES 

In this section, we compare properties for the clusters 
found in common among pairs of the samples introduced 
above. Since all observables are reported within an aper- 
ture i?5oo that is estimated by each individual group, it 
is imperative that we first estimate any systematic dif- 
ferences in mass calibration, so that we may properly 
correct all quantities to a common aperture before we 
perform the pairwise comparison. After characterizing 
differences in mass calibration, we turn to the remaining 
cluster observables: Lx, Mgas, Tx, and lx. 

Readers wishing to avoid this level of detail can exam- 
ine Table [3] and skip to section 13. 6[ where we summarize 
our findings. 
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Fig. 1. — Differen c es in M500 between clusters shared by the IVikhhnin et aLI l l2009al . V09), IMantz et all Il2010bl . MIO), and 
IPlanck Collaboration! l|2011bl . Pll-LS) samples. Filled circles are relaxed/cool-core clusters, while open circles are non-relaxed or non 
cool-core systems. The small horizontal lines along the y axis mark the average mass offset for each of the three cross-comparisons: MIO- 
V09 (sohd red), (P11-LS)-V09 (dashed blue), (Pll-LS)-MlO (dot-dashed green). For the (Pll-LS)-MlO comparison, wc show averages 
computed separately for low and high redshift systems, split at ^ = 0.13 (vertical dotted line). The low and high redshift averages are 
displaced on the lef t and right axes respectively. All averages are computed using only relaxed/cool-core systems. Clusters labeled are 
discussed in 13. 1.21 



3.1. Mass Comparison 

We begin by comparing the mass estimates for the sam- 
ples of galaxy clusters. As discussed above, each group 
calibrates mass-observable relations against hydrostatic 
mass estimates of relaxed systems, then uses these cal- 
ibrated observables to infer masses for the bulk of the 
objects in their samples. It is important to keep in mind 
that the list of galaxy clusters with direct hydrostatic 
mass estimates is relatively small, and some calibration 
clusters are yet published, so directly comparing the ac- 
tual hydrostatic mass estimates of individual galaxy clus- 
ters is difficult. Fortunately, comparing the masses de- 
rived from observational proxies is sufficient for our pur- 
poses. Indeed, consider two groups, A and B, that cali- 
brate the mass observables relations, Ma~Xa and Mb- 
Xb, using their hydrostatic mass estimates of relaxed 
galaxy clusters. The observables Xa and Xb need not 
be the same. By definition, the mass proxy M{X) for 
each group is 



lnMA=(lnM|XA) , 
lnMB = (lnM|XB), 



(1) 
(2) 



where the average is computed over the calibration sets of 
relaxed galaxy clusters by each group. It trivially follows 
that the average mass offset ( InMA(X^) — In Mb{Xb) ) 
between samples A and B is an unbiased estimator of 
the hydrostatic mass offset, so long as one only averages 
over relaxed galaxy clusters. This restriction is there be- 
cause the calibration of M(X) is done using only relaxed 



galaxy clusters. Consequently, we interpret the mass off- 
sets observed here as hydrostatic mass differences, even 
though the mass estimates themselves come from mass- 
observable relations. In Paper II, wc will make use of the 
mass offsets identified here when examining differences in 
published scaling relations. 

Figure [T] shows mass differences for clusters in common 
between the MI0-V09, (P1I-LS)-V09, and (PlI-LS)- 
MIO sample pairs. Mean differences computed using only 
relaxed/cool-core clusters (filled symbols) are listed in 
Table[3]and shown by short lines in the figure. The (Pll- 
LS)-V09 and M10-V09 mass comparisons show mod- 
est offsets of (A In A/) = -0.12 ± 0.02 and (AlnM) = 
0.08 ± 0.02 respectively. The (Pll-LS)-MlO mass offset 
using cool-core systems only is (A In M) = —0.35 ± 0.07, 
but the individual cluster values are sensitive to red- 
shift, with the largest discrepancies above z = 0.13. 
Splitting the sample at z = 0.13, we find mean offsets 
(AlnM) = -0.16 ± 0.07 and (AlnM) = -0.48 ± 0.07 
below and above this redshift, which we hereafter refer 
to as low and high redshifts, respectively. These values 
differ at the 3.2(7 level. 

While it would be useful to determine whether there 
is relative evolution in the other pairings ((P11-LS)-V09 
and M10-V09), the sample overlaps are too small to per- 
form a conclusive test. 

In Paper II, we note that the ix^Msoo relation of 
iPratt et al.l (|2009|) relies on the hyd rostatic mass esti- 
mates of iPointecouteau et al.l (|2005[ ) . The latter work 
has 5 clusters in common with V09. For this common set. 
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we find a mean mass difference (Alnil/) — —0.18 ±0.05 
(P05-V09), consistent with the -0.f2 ± 0.02 mean for 
the (P11-LS)-V09 sample pair. 

3.1.1. Non-relaxed or No Cool-Core Clusters 

Above, we restricted ourselves to relaxed/cool-core 
clusters. This ensures that the observed mass offsets are 
unbiased estimates of the hydrostatic mass offsets in the 
calibration samples. Nevertheless, it is interesting to look 
at the corresponding offsets for non-relaxed systems, as 
shown in Figure [T] (open symbols). 

The mean mass differences for non-relaxed clusters 
do not differ substantially from those of relaxed sys- 
tems. For the M10-V09 comparison, we find (AlnM) = 
0.22 ± 0.11, compared to 0.08 ± 0.02 for relaxed systems. 
For the (P11-LS)-V09 comparison, we find -0.14 ±0.03 
for non-relaxed versus — 0.12±0.02 for relaxed. For (Pll- 
LS)-M10, the non cool-core value for the full sample is 
—0.37 ± 0.05 compared to —0.35 ± 0.07 for cool-core sys- 
tems. The (Pll-LS)-MlO mass offset does not include 
cluster A2034 since it is compromised by a poor redshift 
estimate in Pll-LS (see section [3T2|) . 

3.1.2. Specific Clusters 

In the M10-V09 comparison, Abell 3667 Abell 2163 
stand out as having large offsets, A In A/ ^ 0.5. Both 
of these clusters are well-known merging systems. The 
X-ray morphology of A 3 667 displays a cold fro nt indica- 
tive of a recent merger (jVikhlinin et al.l[2001| ). and the 
cluster displays diametrically op posed radio relics drive n 
by outgoing post-merger shocks (jRottgering et al.lll997[ ). 
Analyzing ro ughly 500 spectrosc opically-confirmed clus- 
ter members. lOwers et al.l()2009D identify multiple galaxy 
concentrations, the dominant pair of which align with the 
merger axis seen at X-ray and radio wavelengths. 

A 2163 is a complex system that involves an East- West 
merger. There is also an additional northern component 
lying at nearly the same redshift as the main body, but 
thoug ht to be physically distinct ([Maurogordato et"al] 
12008). A 2163 also has a cold front and radio halo 
(jBourdin et al.l[2011| ) , and a multi- modal pr ojected mass 
map derived from gravitational lensing (jOkabe et al.l 

EonD. 

The clusters A 1763 and A 2261 exhibit the largest 
discrepancy in the (Pll-LS)-MlO comparison. A 
1763 is the dominan t member of a binary supercluster 
(jBiviano et al.l [20 111 ). A 2261 has been recently inves- 
tigated with s trong and weak lensing analysis by the 
CLASH team ()Coe et al.ll2012D , who produce several es- 
timates under different modeling conditions. Their main 
result corresponds to M500 = 13.4 x 10^^ Mq, which fa- 
vors the higher value of 14.4x IO^^Mq given by MIO. The 
shape and orientation of A2261's host halo, along with 
projection of surrounding large-scale structure, are iden- 
tified as key sources of systematic uncertainty in their 
lensing analysis. They suggest a worst-case scenario in 
which there is a 2:1 axis ratio along the line of sight, re- 



sulting in a lower mass. Mi 



500 



9.4 X lO^'' Mq. This 
value is still closer in log to the MIO mass than it is to 
the Pll-LS mass of M500 = 6.4 x 10^'' Mq. We also label 
the cluster A 2034 because it stands out as a clear out- 
lier in the Lx and Mgas comparisons of Pll-LS and MIO. 
This can be traced to an incorrect redshift assignment in 
Pll-LS. 



3.2. Lx Comparison 

We follow iPianck Collaboration! (j2011bl ) in defining Lx 
as the X-ray luminosity within a cylindrical aperture 
i?50o in the 0.1 — 2. 4 keV rest frame band . Th is definition 
is also adopted bvlMantz et al.l (l2010bD and iPratt et aLl 
()2009D . whereas Ivikhlinin et al.l ()2009aD defines Lx in 
the 0.5 — 2.0 keV band and integrates within a fixed, 
2 Mpc aperture. We convert the Vikhlinin luminosi- 
ties to the 0.1 - 2.4 ke V band by muhiplying by 1/0.62 
(jBohringer et al.l |2004[) , and then again by a factor of 
0.96 to rescale to luminosities within i?5ooE3 Because 
the mass calibration is different for each of the three 
works, we should in principle correct the luminosities 
within i?5oo to a common aperture. In practice, how- 
ever, the dependence of Lx on aperture is sufficiently 
weak that the corresponding systematic offsets are com- 
pletely negligible (see Appendix IB|) . 

Figure i shows our M10-V09, (P11-LS)-V09, and 
(Pll-LS)-MlO Lx comparisons, as labelled. There 
is one gross outlier in this comparison: Abell 2034, 
which we do not include when estimating the offset be- 
tween the various works. After outlier removal, we find 
(AlnLx) = -0.01±0.02for the (P11-LS)~V09 compari- 
son, 0.12 ±0.02 for M10-V09, and -0.11 ±0.02 for (Pll- 
LS)-M10. A similar cluster-by-cluster comparison of Lx 
for high-redshift systems between MIO and V09 galaxy 
clusters shows no evidence of offset in Lx (Mantz, private 
communication), suggesting that the V09-M10 difference 
observed here may be due to the treatment of ROSAT 
data. Table [3] summarizes the mean offset values. 

A2034 is an outlier because different redshifts are em- 
ployed by Pll-LS (z = 0.151) and MIO (z 0.113). The 
difference in redshift is Az/z w 0.3, which induces an er- 
ror A In Lx « 0.6. Examination of spectroscopic galaxies 
in the SDSS makes it apparent that the lower of the two 
redshifts is the better value; we find z = 0.1137 ± 0.0008 
using 31 galaxies with spectroscopic redshifts within 5 
arcmin of the central galaxy. Figure [2] shows in purple 
the luminosity offset for A2034 after correcting for the 
difference in luminosity distances to this object (ignor- 
ing the corresponding k-correction). 

3.3. Mgas Comparison 

The top panel in Figure [3] shows differences in pub- 
lished values of Mgas, the gas mass within a sphere of 
radius i?5oo, between the various works. These values 
are not yet corrected to a common apertureQ Abell 
2034 is again an outlier, so we do not include it in any 
of our calculations. Abeh 2261 (A In Mgas = -0.41) and 
Abell 1763 (A In Mgas = —0.37) appear to have unusu- 
ally low A In Mgas values in the (Pll-LS)-MlO compar- 
ison. However, using the absolute median deviation to 
estimate a Gaussian width in the presence of outlieriFl. 
we find that these clusters are 2.7(t and 2.4(t away from 

This correction factor was estimated independently by both A. 
Vikhhnin and A. Mantz (private communi cation). It is somewha t 
higher than the 0.91 factor advocated bv IPiliaretti et al] II2011I ). 
likely reflecting the f act that they in t egrate their model out to 
S-Rsoo, larger than the I Vikhlinin et al.l l l2009al ) scale. 

IVikhlinin et al.l Il2009al) does not report Mgas directly, but 
rather M500 as estimated from Mgas. We use the reported /gas-M 
relation of that work to recover the original Mgas values. 

^•^ For a Gaussian distribution, the median absolute deviation d 
is related to the standard deviation via a = 1.4826d. 
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Fig. 2. — As Figurc[T] but now comparing X-ray luminosity, Lx, 
for clusters in pairs of samples. The shifted A2034 point shows 
the Lx difference after correcting the Pll-LS value to the MIO 
redshift. 



zero. Consequently, we do not consider them outliers. 

After removing Abell 2034, the M10-V09 and (Pll- 
LS)-V09 mean offsets are (AlnMgas) = 0.08 ± 0.02 and 
-0.02 ±0.02 respectively. The (Pll-LS)-MlO compari- 
son reveals there is redshift evolution in the mean offset 
between the two works that mirrors the total mass be- 
havior of section [Sm with (A In A/gas) = -0.10 ± 0.04 
and —0.23 ± 0.04 at low and high redshift respectively. 
As we now show, these differences primarily reflect the 
differences in apertures induced by the mass differences 
between the three works. 

As detailed in Appendix [B] a bias 6a/ in total mass 
induces a bias in the gas mass within i?5oo that roughly 
scales as b^;^. To estimate Aigas using a common aperture 
for each cluster, we subtract the geometrically induced 
offset AlnAfgas = 0.4AlnAf from the observed values. 
This correction is applied on a cluster-by-cluster basis. 

The bottom panel in Figure [3] shows the Afgas compar- 
ison after applying this systematic aperture correction. 
The corresponding (AlnAfgas) offsets, listed in Tabled 
are 0.03 ± 0.02 (M10-V09), 0.03 ±0.02 ((P11-LS)-V09), 
-0.02 ± 0.03 ((Pll-LS)-MlO, low z), and -0.04 ± 0.02 
((Pf f-LS)-Mf 0, high z). There is no evidence of sys- 
tematic offsets after the aperture correction is applied, 
demonstrating that A^gas is a quantity that is robustly 
measured by the groups. This finding will play a key 
role in our discussion in section 21 

3.4. Tx Comparison 

We define Tx as the spectroscopic temperature es- 
timated within a cylindrical annulus of radius R G 
[0.15, f .0]i?5oo- This matches the Mantz and Vikhlinin 
definition, but differs from that of Planck, who use an 
annulus of radius R G [0.15, 0.75] i?5oo- We must cor- 
rect for this difference in aperture before we perform our 
comparison. To do so, we use temperature estimates 
within both radial scales for the REXCESS sample of 
iPratt et all (|2009D . Their data imply a modest correc- 
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Fig. 3. — As Figure [l] but now comparing A/gas. The top panel 
shows the raw A^fgas offsets, while the bottom panel accounts for 
the difference in aperture Rsoo coming from the systematic mass 
offsets. Cluster Abell 2034 is excluded from the e stimate of the 
mean A/gas offset as per the discussion in section r3.2l Clusters Abell 
2261 and Abell 1763 exhibit the largest A^gas offsets, but are not 
statistical outliers in A/gas after applying the aperture corrections. 

tion of the form 

rx([0.15, l]i?50o) ^ / rx([0.15,0.75]i?5oo) y "' 
rx([0.15,0.75]i?5oo) ■ V 5.0 keV J 

(3) 

We use this relation to correct all of their Tx values to a 
[0.15, l]i?5o o apertu re. As we were completing this work, 
iRozo et"al1 ()2012aD argu ed that the t emperature correc- 
tions calibrated from the IPratt et al.l (2009) data are not 
correct, and that one incurs a greater error by applying 
this correction than by simply ignoring the difference in 
the definition of Tx- Using deeper XMM-Newton data, 
Pratt (private communication) estimates the tempera- 
ture ratio at 0.97 rather than 0.95. We will ignore this 
small difference in our discussion, and simply note that 
the temperature correction we apply is likely too large 
by « 2% - 3%. 

There are other slight differences in the temperature 
definitions between the three groups that we do not at- 
tempt to account for. The spectral energy range used to 
derive X-ray temperatures are subtly different: [0.6-10], 
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Fig. 4. — As Figure[T] but now comparing Tx. There is no MIO- 
V09 comparison because there are only two clusters in the o verlap 
sample with temperature estimates from lMantz et all I l2010bl ) . The 
(Pll-LS)-MlO comparison is limited to the subset of 12 c lusters 
that have temperature estimates from lMantz et all Il2010bl) . 

[0.8-7.0], and [0.3-10] kc V for V09, MIO, and PH-L S, re- 
spectively. Si mulations (iMathiesen fc Evrardll200lD and 
observations (jCavagnolo et al.ll2008D suggest that derived 
temperatures depend on bandpass, with harder spectral 
ranges yielding generally higher temperatures. However, 
we expect these effects should be small for the subtle 
bandpass differences between V09, MIO and Pll-LS. In 
addition, in principle, we should also correct all temper- 
atures to ensure a consistent choice of i?5oo- In practice, 
however, the dependence of Tx on i?5oo is weak enough 
that it may be safely neglected (see Appendix [B|) . 

Figure 12 shows the (P11-LS)~V09 and (Pll-LS)-MlO 
temperature comparisons. The mean offsets of —0.13 ± 
0.02 and -0.14 ± 0.05 respectively, are listed in Tabled 
In the (Pll-LS)-MlO comparison we limit ourselves to 
the subsamplc of 12 galaxy clusters that have indepen- 
dently estimated temperatures. We do not show a MIO- 
V09 comparison because only 6 of the 16 clusters in the 
overlap sample have independ ent temperatu re measure- 
ments, and 4 of those are from iHorneH ()2001f ). which are 
neither core nor clump excised. The cluster RX-J0232.2, 
observed by Pll-LS and MIO, exhibits the largest dis- 
crepancy, but the large error in the MIO temperature of 
Tx = 10.06 ± 2.31 keV means that this difference is not 
significant. 

Differences similar to the 13% offset we find be- 
tween Chandra and XMM-Newton temperatures have 
been noticed before, and are typically a ttributed to in- 
strument calibration ii ncertainties (e.g. iVikhlini n et al. 
I2005t IPratt et all 120091) . For example . iNevalainen et al. 
( 2010[ ) find instrumental calibration can lead to system- 
atic oflFsets AlnT = -0.08 (ACIS/MOSl) or AlnT = 
-0.14 (ACIS/MOSl and ACIS/M0S2). Interpretation 
of our results as temperature calibration offsets, how- 
ever, is not t rivial, as the C h andra calibration version 
emplove d bv lyTkhlinin et al.l (I2009af ) is different from 
that of INevalainen et al.l ()2010D " We note also that 
INevalainen et al.l ()2010[) employ a spectral bandpass of 
[0.5, 7]keV, slightly different from the choices of the other 
three groups. 
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Fig. 5. — As Figure [l] but now comparing Yx- The gas mass has 
been corrected for the difference in the aperture -R500 based on the 
results from Figure [T] 

In addition, it is also clear that differences in substruc- 
ture masking plays a role in at least a subset of the 
galaxy clusters under consideration. In particular, the 
two largest outliers in the (P11-LS)-V09 comparison are 
Abell 3376 and Abell 2256. Both of these systems are 
known to have large, cool substructures, which the final 
temperature estimates are sensitiv e to. A comparison of 
the masks used to estimate Ty inlPlanck Collaboration! 
(|2011bD and IVikhlinin et al.l (|2009a[) reveals that there 
the two masks are different (A. Vikhlinin and H. Bour- 
din, private communication). A reanalysis of the X-ray 
data of these two clusters using the same masks results 
in significantly better agreement between the two works 
(A. Vikhlinin and H. Bourdin, private communication). 

3.5. Yx Comparison 

The X-ray gas thermal energy, Yx, is defined as the 
product Yx = MgasTx- Figure [B] compares the Yx esti- 
mates of galaxy clusters between the various works. We 
have again corrected the IPlanck Collabor ation' (' 2011bfl 
temper atures to match the definition of IVikhlinin et alj 
(|2009aD . and we have corrected the Mgas values to a com- 
mon metric aperture as per section 13.31 We do not in- 
clude a M10-V09 comparison, since there are only two 
clusters with independent core-excised temperature mea- 
surements. 

We find (Alnyx) = -0.15±0.03 for the (P11-LS)-V09 
difference and -0.19±0.05 for (Pll-LS)-MlO. The (Pll- 
LS)-M10 offset is essentially in perfect agreement with 
the expectation that (AlnYx) = (AlnMgas)-h(AlnTx), 
whereas for the (P11-LS)-V09 comparison, we have 
(AlnMgas) + (AlnTx) = -0.10 ± 0.03, which suggests 
the Afgas and Tx offsets of Pll-LS and MIO are corre- 
lated. The most discrepant system is again RX-J0232.2, 
but the error in the X-ray temperature in MIO for this 
system is large. 

3.6. Summary 

Table |3| summarizes the mean logarithmic offsets for 
each of the cluster properties considered in this work, 
with errors in the mean values derived from bootstrap 
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TABLE 3 

Mean Log Differences in X-ray Properties for. Sample Pairs 



Property 


MIO - V09 


Pll-LS - V09 


Pll-LS - MIO 
Low X [z < 0.13) 


Pll-LS - MIO 
High z > 0.13) 




0.12 ±0.02 


-0.01 ±0.02 


-0.12 ±0.02 


-0.10 ±0.03 




0.03 ± 0.02 


0.03 ±0.02 


-0.02 ±0.03 


-0.04 ±0.02 






-0.13 ±0.02 




-0.14 ±0.05 


Y^ab 




-0.15 ±0.03 




-0.19 ±0.05 




0.08 ± 0.02 


-0.12 ±0.02 


-0.16 ±0.07 


-0.48 ±0.07 




0.22 ±0.11 


-0.14 ±0.03 


-0.25 ±0.03 


-0.45 ±0.06 



^Offset computed after outlier removaL 
''Offset computed after correction to a common aperture. 
'^Relaxed/cool core only. 
Non-relaxed/no cool core only. 



resampling of the clusters in common between each pair 
of samples. 

We find significant systematic offsets in total mass esti- 
mates between the various groups, being as large as 20% 
at low redshift, and growing to « 45% at z « 0.2 for the 
(Pll-LS)-MlO comparison. We re-emphasize that this 
comparison necessarily rests on mass proxies (rescaled 
observables) rather than direct hydrostatic mass esti- 
mates. However, we have argued that this mass off- 
set should offer an unbiased estimate of the hydrostatic 
masses applied by the different groups, since each uses 
hydrostatic masses to calibrate their relevant observable- 
mass scaling relation. Future work comparing explicit 
hydrostatic masses across different groups is highly de- 
sirable. 

After applying an aperture correction based on total 
mass estimates, we find that all groups are in good agree- 
ment when it comes to estimating Mgas- The level of 
systematic differences in the mean are only a few per- 
cent, consistent with the overall systematic uncertainty 
between Chandra and XMM-Newton cross-calibration in 
the soft X-ray band, considered to be ~ 10% in flux or 5 % 
in Mgas (jNevalainen et al.ll2010HTsuiimoto et al.ll2011[ ). 

The good agreement in Afgas is not entirely reflected 
in the Lx comparison, where we find 12% systematic 
offsets in the M10-V09 and low redshift (Pll-LS)-MlO 
comparisons. These offset are not simply due to in- 
strumental calibration, since otherwise one should have 
A In Mgas = O.SAlnLx, which is not satisfied. Detailed 
comparative studies, in which the same cluster observa- 
tions are independently analyzed by different groups, are 
needed to clarify the origins of the Lx differences found 
here. 

Consistent wit h prev ious calibration studies such as 
iNevalainen et al.l (|2010( ). we find ~ 13% differences in 
mean temperature estimates of clusters. The principal 
source of these offsets is not clear: it could reflect in- 
strumental calibration, or differences in the data analysis 
(e.g. substructure masking). Differences in gas thermal 
energy, Yx, are consistent with the product of the differ- 
ences in Mgas and Tx for the (Pll-LS)-MlO comparison, 
but there may be differences in the (P11-LS)-V09 com- 
parison. This difference may be evidence that the Mgas 
and Tx systematic offsets are correlated. 

4. REDSHIFT BEHAVIOR OF (Pll-LS)-MlO ESTIMATES 

In the analysis of M500 above, the (Pll-LS)-MlO val- 
ues display redshift sensitivity, with larger discrepancy 
for z > 0.13. A detailed discussion of possible inter- 



pretations of this result is presented in Appendix [Cj 
Here, we present only a summary of the discussion, since 
the details are cumbersome. We first verity that the 
evolution in the mass ratio is significant by fitting a 
power-law model AlnM oc 7ln(l + z), which requires 
7 = —1.90 ± 0.44, indicating evolution at high confi- 
dence. We attempt to ascertain which of the two data 
sets is driving this evolution. Specifically, we first assume 
that all evolution is due to the MIO data set, and then 
consider the observational implications of this hypothe- 
sis. We then consider the converse hypothesis — that all 
evolution is due to systematics in the Pll-LS data set — 
and consider its observational implications. We are not 
able to rule out either hypothesis, but we present the ba- 
sic results and suggest further work to clarify the source 
of the discrepancy. Of course, the true answer may lie 
between the two simple extremes posed here. 

Suppose that the observed evolution is due solely to 
systematics in the MIO data set. MIO adopted a con- 
stant /gas model, deriving M^qq from Mgas by assuming 
a fixed gas fraction value, fo- Since all groups agree on 
Afgas(i?) within a common aperture, the only problem 
that can arise with the MIO masses is if their constant 
/gas assumption is incorrect. Given an arbitrary /gas 
model, the bias induced by assuming /gas = fo is 



AlnM = 1.67 In 



UUM,z) 
fo 



(4) 



where the value of 1.67 arises from the aperture correc- 
tion discussed in § 13.31 We have verified that if one uses 



an /gas model derived from the Pll-LS data, one can re- 
produce the mass-offset behavior seen in Figure [TJ Note 
in replacing the /gas model in this way, the agreement in 
M500 simply refiects the agreement in Mgas between the 
two works. 

The redshift and mass dependence of the gas mass frac- 
tion i s an active topic of investigation. Most works to 
date (jVikhlinin et al.ll2009at iSun et al.l[200l iPratt et al.l 
12001 argue that the gas mass fraction /gas oc M"g where 
Ola » 0.1 — 0.2. In addition, a recent study by iLin et al.l 
(|2012D argue not only for mass scaling {ag ~ 0.13), but 
also redshift scaling. On the other hand, [A llen et alj 
(j2008| ) — on which the work of iMantz et"aIT (,2010b) is 
based — find no scaling of /gas with mass and / or redshift 
within the range probed by their data. In short, whether 
or not the MIO masses are correct depends on the slope 
of the fgns-M relation, and there are plausible models 
for the feas-M relation that can induce bias in the MIO 
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masses. Whether such a bias is real or not, however, re- 
quires a better understanding of the gas mass fraction 

/gas(M,z). 

We now consider the alternate proposition, namely 
that the Pll-LS masses are biased. The good agreement 
in Mgas between MIO and Pll-LS suggest that the total 
mass bias should be sourced primarily by cluster temper- 
ature. Thus, if the MIO masses are unbiased, the evolving 
bias in the (Pll-LS)-MlO comparison implies an evolv- 
ing temperature bias in the Pll-LS temperatures. We 
search for such an evolution by considering an effective 
"SZ temperature", Tsz = ^sz /A/gas, and examine the 
ratio Tsz/T^ = >sz/>x in the Pll-LS data using the 
published Fsz values derived from Planck satellite ob- 
servations. We find that the Fsz/^x ratio of Pll-LS 
exhibits redshift dependence at the 2.7a level, suggestive 
but not definitive. We further demonstrate that such an 
evolution — if real — is not sourced by selection effects 
(at least in the form of S/N cuts) , but would have to be 
endemic to the cluster population. 

We also note in Appendix [Cl that the mean ln(lsz/^x) 
value for z < 0.13 clusters is slightly negative (—0.06 ± 
0.04) while the value for z < 0.13 clusters is slightly pos- 
itive (0.08 ± 0.04). The redshift transition lies in the 
range that divides th e A an d B subsets of clusters in 
iPlanck Collaboration! (|2011bf ). These samples are char- 
acterized as having i?5oo values that subtend angular 
scales greater than or less than 12', respectively, where 
12' is the maximum angular scale within which the X-ray 
background can be determined in a single XMM-Newton 
field of view. Thus, it is possible that the redshift evolu- 
tion we observe simply reflects systematic differences in- 
herent to the A and B cluster populations. The most ob- 
vious possibility is background subtraction, as the back- 
ground treatment for A and B clusters i s differe nt. That 
said, as noted bv IPlanck Collaboration! (j2011bD . the set 
of clusters in Pll-LS is neither representative nor com- 
plete, so we heed that caution, and leave it to future work 
to further pursue the origin of the systematic difference 
between A and B clusters in Pll-LS. 

We note, however, that the ratio >sz/>x = 1.08±0.04 
seen above z = 0.13 is unusually h igh. This result in 
in strong (6.3(t) tension with that of !Rozo et al.! ()2012a|) 
for z w 0.1 galaxy cluster using Planck and Chandra 
data. What is most surprising about this result, how- 
ever, is that it appears to be in conflict with X-ray ex- 
pectations not only at the quantitative but even at the 
qualitative level. Specifically, galaxy clu sters are known 
to have falling temperatu re profiles (e.g !Vikhlinin et al.l 
!2006t !Arnaud et aLl!2010D . Compared to SZ measure- 
ments. X-ray temperature weigh the inner, hotter (re- 
member these are core-excised temperature) regions of a 
galaxy cluster more heavily than the outskirts. This in 
turn leads one to a generic prediction that Ya^i/Yx < 1, 
a prediction that is only strengthened by the presence of 
unresolved gas clumping. It seems difficult to reconcile 
this X-ray expectation to the Isz/^x = 1-08 ± 0.04 seen 
above z = 0.13. 

5. SUMMARY AND CONCLUSIONS 

We compare recent estimates of cluster properties 
within i?5oo published by three independent groups 
based on Chandra and XMM-Newton data. Using 
clusters in common to pairs of studies, we form ratios 



of the estimated total masses, gas masses, temperatures 
and gas thermal energies for individual clusters. Mean 
values of these ratios are used to gauge the level of 
systematic error in X-ray analysis. Our findings are 
summarized as follows. 

• Total masses derived from scaled observables differ 
at the 10% to nearly 50% depending on the two works 
being compared, and the redshift range sampled. These 
differences exist even for relaxed and cool-core clusters. 

In the (Pll-LS)-MlO comparison of 16 systems at 
z > 0.13, total masses deviate by 48% ± 7% in the mean, 
with A2261 and A1763 having individual mass estimates 
that differ by more than a factor of two between the two 
groups. Similar d ifferences have been noted before (see 
iRvkoff et al.!!20lM Appendix B), but this work is the first 
to attempt to quantify these systematic offsets between 
statistically relevant clusters samples drawn from the lit- 
erature. As we demonstrate in paper II, these differences 
are the primary source of tension between the cluster 
scaling relations derived by the three groups we consider. 

• -A^gas within a common scaled aperture is a quantity 
that is robustly estimated. 

If the behavior of the gas mass fraction, /gas(M, z), 
were known, this result suggests Mgas would be the 
most observationally robust X-ray mass proxy. There 
remains debate as to the gas fraction behavior. While 
most studies in the literature find that /gas scales with 
mass as M°-^-° - ^ (!Vikhlinin et al! !2009at !Pratt et al.l 
[20091: !Sun et al.l !2009l: !Lin et al.! !2012! ). there is also 
evidence that the scaling saturates t o a constant valu e 
for the most massive galaxy clusters ([Allen et al.l!2008| ). 
Full convergence on a unique solution has not yet been 
reached, and current differences may simply reflect 
overall differences in the mass range sampled by the 
various galaxy cluster samples. 

• There are 5% — 15% systematic differences in Lx, 
Tx and Yx between the various groups. 

The deviations in Yx are consistent with the product 
of the Mgas and Tx deviations in the (Pll-LS)-MlO 
comparison, but less so for the (P11-LS)-V09 compar- 
ison, suggesting correlated offsets. The primary source 
of these disc repancies is uncle a r: ne it her instrumental 
calibration ([Nevalainen et all [20101 : !Tsuiimoto et al.l 
[20T1I) nor methodological differences can currently be 
ruled out, and both appear to contribute at some level. 
Resolving this problem requires a concerted program to 
explicitly test data reduction pipelines, using both real 
and simulated data, in a controlled environment. 

• There is apparent redshift evolution in the relative 
(Pll-LS)-MlO mass calibration, with strong tension 
between the two works at z > 0.13. 

We show that modifying the [Mantz et all ([2010bl ) 
masses to incorporate a mild trend in /gas with mass 
can account for the differences seen, but an alternative 
scenario involving redshift evolution in Pll-LS mass es- 
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timates is also possible. In the latter case, the Y^z/Yy^ 
ratio should also evolve with redshift, and such evolution 
is in fact seen at modest {2.7a) significance. The average 
value of Isz/ix = 1.08 ± 0.04 for z > 0.13 galaxy clus- 
ters is high co mpared to recent Chandra measurements 
of 0.82 ± 0.02 (jRozo et al.ll2012a[ ). and difficuh to recon- 
cile with the well-established falling temperature profiles 
of galaxy clusters. 

X-ray-based cluster methods have a long and storied 
history as cosmological probes, and the cosmological con- 
straints from cluster samples today are competitive with, 
and complementary to, the best large-scale structure 
methods available. Even if weak lensing methods of mass 
calibration eventually become more robust, thanks to 
low-scatter mass proxies such as Mgas and Ix, X-ray ob- 
servations are guaranteed to remain invaluable in helping 
improve cosmological constraints in future cluster sam- 
ples. Realizing the full promise of the cluster population, 
however, requires that the systematic differences high- 
lighted in this work be fully resolved. 
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APPENDIX 
A: SAMPLE COMPARISON DATA 

The data employed in our comparisons is presented here. Table S] lists clusters common to the M10-V09 samples, 
Table[S]the (P11-LS)-V09 samples, and TablclHlthe (P11-LS)~M10 samples. As per the discussion in section [TTTl we 
report only central values, since the quoted uncertainties cannot explain the systematic shifts in most properties. Data 
compi l ed in these tables arc raw values extracted from the relevant papers (|Vikhlinin et al.|[200l [2009at IMantz et al.l 
I2010bt iPlanck Gollaboration 20lTbh . and before applying any of the corrections described in the text. 

Units are as follows: Lx (W* ergs/s), Mgas (lO" Mq), Tx (keV) , ( 10^^ keVMo), C-^D\Y^-z (lO" keVMo). 
We assume C = 1.416 x Mpc^eV^Mg ^ as per lArnaud eFall (fMot) . 

B: CLUSTER PROFILES AND APERTURE CORRECTIONS 

We consider the mean radial profile of three quantities: gas density, Pe a s(R), electron temperature T{R), and 
pressure P{R)- We use x = R/R^oo- For pgas, we use the iPratt fc ArnaudI (|2002[) parameterization as reported by 
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TABLE 4 

M10-V09 Cluster Sample 



Name 




2 


r Vik 


T Mantz 


gas 


i\/r Alantz 


JlfVik 


T\/rAIantz 
"^500 


ASS 


0, 


.0557 


2.91 


5.70 


0.69 


0.82 


5.98 


7.20 


A401 





.0743 


3.90 


6.80 


1.15 


1.16 


8.63 


10.10 


A478 





.0881 


7.24 


13.30 


0.95 


1.15 


8.15 


10.10 


A1795 





.0622 


3.52 


6.00 


0.61 


0.63 


5.46 


5.50 


A2Uzy 





.0779 


5.72 


10.60 


1.01 


1.16 


8.64 


9.30 


A2142 


0, 


.0904 


7.20 


12.40 


1.52 


1.59 


11.96 


13.90 


A2204 


0, 


.1511 


9.35 


17.90 


1.15 


1.18 


9.40 


10.30 


A2244 


0, 


.0989 


2.98 


5.20 


0.67 


0.70 


5.11 


6.20 


A2597 


0, 


.0830 


2.09 


4.30 


0.32 


0.33 


2.84 


2.90 


A3112 


0, 


.0759 


2.43 


4.40 


0.46 


0.47 


4.20 


4.10 


RX J1504 


0, 


.2169 


15.60 


27.60 


1.11 


1.25 


10.07 


11.00 


A2163 


0, 


.2030 


13.70 


28.70 


3.36 


4.40 


21.98 


38.50 


A2256 


0, 


.0581 


2.66 


4.60 


0.72 


0.82 


7.85 


7.20 


A3266 


0, 


.0602 


2.69 


4.90 


0.92 


1.06 


9.00 


9.20 


A3558 


0, 


.0469 


1.96 


3.70 


0.63 


0.73 


4.78 


6.40 


A3667 


0, 


.0557 


3.14 


5.80 


1.06 


1.35 


7.35 


11.80 



Note. — Units are 10** ergs/s for Lx, 10^* Mq for mass. Precise definitions 
specified in tfie text. The list is divided by tlie blank row into relaxed (upper) and 
merging (lower) systems. Values listed are raw data drawn from tfie literature, 
before applying any of the corrections mentioned in the main body of the text. 



TABLE 5 

(P11-LS)-V09 Cluster Sample 



Name 




z 


J PI 


r Vik 




j^^Vik 
gas 


rpPl 

-'x 


rpV ik 
^X 


^x 


vVik 
^X 


^"500 






A85 


0.0557 


4.65 


2.91 


0.66 


0.69 


5.78 


6.45 


3.81 


AAA 


5.30 


5.98 


3.32 


A401 


0.0743 


5.82 


3.90 


1.02 


1.15 


7.26 


7.72 


7.41 


8.52 


7.65 


8.63 


5.86 


A478 


0.0881 


12.33 


7.24 


1.06 


0.95 


6.43 


7.96 


6.82 


7.74 


7.23 


8.15 


6.50 


A1413 


0.143 


3.39 


4.11 


0.53 


0.81 


6.59 


7.30 


3.49 


6.93 


4.90 


7.57 


4.87 


A1650 


0, 


.0823 


3.79 


2.33 


0.51 


0.54 


5.11 


5.29 


2.61 


2.82 


4.22 


4.59 


3.11 


A1651 


0, 


.0853 


4.23 


2.93 


0.56 


0.64 


5.23 


6.41 


2.93 


4.23 


4.51 


5.78 


2.54 


A1795 


0, 


.0622 


5.90 


3.52 


0.73 


0.61 


6.60 


6.14 


4.82 


3.80 


5.96 


5.46 


3.25 


A2029 


0, 


.0779 


10.00 


5.72 


1.12 


1.02 


7.70 


8.22 


8.62 


8.55 


8.30 


8.64 


5.72 


A2204 


0, 


.1511 


15.73 


9.35 


1.09 


1.15 


7.75 


8.55 


8.45 


10.17 


8.04 


9.40 


7.84 


A2390 


0, 


.2329 


17.20 


10.49 


1.54 


1.51 


8.89 


9.40 


13.69 


13.84 


10.35 


11.02 


11.72 


A3112 


0, 


.0759 


3.84 


2.43 


0.40 


0.46 


5.02 


5.19 


2.01 


2.40 


3.67 


4.20 


1.27 


A3158 


0, 


.0583 


2.66 


1.72 


0.53 


0.54 


5.00 


4.67 


2.65 


2.33 


4.29 


4.13 


2.47 


Zwl215 


0, 


.0767 


2.88 


1.80 


0.63 


0.61 


6.45 


6.54 


4.06 


4.18 


5.45 


5.75 


3.25 


A1689^ 


0, 


.1832 


13.29 


7.20 


1.08 


1.13 


8.17 


8.85 


8.82 


9.57 


8.19 


9.02 


9.68 


A119 


0, 


.0445 


1.52 


1.06 


0.45 


0.39 


5.40 


5.72 


2.43 


2.69 


4.12 


4.50 


1.91 


A754 


0, 


.0542 


4.68 


2.78 


1.04 


0.79 


8.93 


8.73 


9.29 


8.19 


8.69 


8.47 


6.07 


A1644 


0, 


.0475 


1.66 


1.14 


0.41 


0.53 


4.86 


4.61 


1.99 


2.48 


3.66 


4.29 


1.77 


A2065 


0, 


.0723 


3.20 


1.82 


0.60 


0.56 


5.36 


5.44 


3.22 


3.24 


4.78 


4.98 


2.75 


A2163 


0, 


.2030 


23.86 


13.70 


3.17 


3.36 


13.40 


14.72 


42.48 


45.99 


19.68 


21.98 


1.48 


A2256 


0, 


.0581 


3.92 


2.66 


0.78 


0.72 


6.40 


8.37 


4.99 


7.16 


6.11 


7.84 


5.01 


A3266 


0, 


.0602 


4.22 


2.69 


0.96 


0.92 


7.46 


8.63 


7.16 


9.12 


7.51 


9.00 


6.36 


A3376 


0, 


.0455 


0.92 


0.59 


0.28 


0.26 


3.39 


4.37 


0.95 


1.33 


2.39 


3.01 


0.71 


A3558 


0, 


.0557 


3.54 


1.96 


0.67 


0.63 


4.78 


4.88 


3.20 


2.99 


4.77 


4.78 


2.97 



Note. — Units are 10** ergs/s for Lx, 10^* Mq for mass, keV for temperature, and 10^* MQkeV for Yx and 
D^Ygz- Precise definitions of all quantities are specified in the text. The list is divided by the blank row into 
relaxed (upper) and merging (lower) systems. 

'^Though relaxed in appearance, A1689 is known to have several structures superimposed along the line of sight 
HLokas et al.ll2006h . 



iPiffaretti etall ((Ml . 



X f a; ^ -3^!/2+a/2.0 

Pgas{x) OC ( I I 1 



^ Xc J \ ^ 

with Xc = 0.303, a = 0.525, and /3 = 0.768. The temperature profile is parameterized bv lVikhlinin et all (j2006D as 

0.45 + r 1 



(Bl) 



T{x) OC 



1 + r l + (a;/0.6)2 



(B2) 
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TABLE 6 
(Pll-LS)-MlO Cluster Sample 



Name 


z 


J PI 


r A/ antz 


MP' 


-'''-'gas 


rpPl 

-'x 


rpA'I antz 

-'x 


vPi 
^x 


\^ A'l ant z 


*-'50() 


i\ rAIantz r^ — 


■^D\Y^'L 


A85 


0.0557 


4.65 


5.70 


0.66 


0.82 


5.78 


— 


3.81 


— 


5.30 


7.20 


3.32 


A478 


0.0881 


12.33 


13.30 


1.06 


1.15 


6.43 


— 


6.82 


— 


7.23 


10.10 


6.50 


A963 


0.206 


6.40 


6.50 


0.66 


0.78 


5.49 


6.08 


3.62 


4.74 


4.95 


6.80 


2.90 


A1689 


0.1832 


13.29 


13.60 


1.08 


1.21 


8.17 


— 


8.82 


— 


8.19 


10.50 


9.68 


A1795 


0.0622 


5.90 


6.00 


0.73 


0.63 


6.60 


— 


4.82 


— 


5.96 


5.50 


3.25 


A2029 


0.0779 


10.00 


10.60 


1.12 


1.07 


7.70 


— 


8.62 


— 


8.30 


9.30 


5.72 


A2261 


0.224 


9.97 


12.00 


0.93 


1.65 


6.23 


6.10 


5.79 


10.07 


6.41 


14.40 


8.33 


A2390 


0.2329 


17.20 


17.30 


1.54 


1.73 


8.89 


10.28 


13.69 


17.78 


10.35 


15.20 


11.72 


A3112 


0.0759 


3.84 


4.40 


0.40 


0.47 


5.02 


— 


2.01 


— 


3.67 


4.10 


1.27 


RX J0232 


0.2836 


12.53 


13.30 


1.07 


1.45 


6.41 


10.06 


6.86 


14.59 


6.95 


12.70 


6.07 


RX J0528 


0.3839 


10.55 


11.60 


1.11 


1.52 


6.04 


7.80 


6.70 


11.86 


6.88 


13.30 


8.33 


A401 


0.0743 


5.82 


6.80 


1.02 


1.16 


7.26 


— 


7.41 


— 


7.65 


10.10 


5.86 


A520 


0.203 


7.11 


8.40 


1.13 


1.37 


7.74 


7.23 


8.75 


9.91 


8.11 


11.90 


6.99 


A665 


0.1818 


6.81 


8.60 


1.12 


1.46 


7.64 


— 


8.56 


— 


8.04 


12.70 


7.70 


A773 


0.217 


6.80 


7.50 


0.89 


0.98 


6.78 


7.37 


6.03 


7.22 


6.55 


8.60 


6.07 


A781 


0.2984 


4.75 


6.00 


0.76 


0.90 


5.72 


5.10 


4.35 


4.59 


5.35 


7.90 


5.08 


A1763 


0.2279 


8.00 


10.50 


1.14 


1.94 


6.55 


6.32 


7.47 


12.26 


7.37 


17.00 


9.04 


A1914 


0.1712 


10.73 


11.30 


1.07 


1.21 


8.26 




8.84 




8.19 


10.60 


7.06 


A2034 


0.113 


6.99 


4.00 


1.13 


0.77 


7.01 




7.92 




7.76 


6.70 


5.23 


A2163 


0.203 


23.86 


28.70 


3.17 


4.40 


13.40 


12.27 


42.48 


53.99 


19.68 


38.50 


32.13 


A2218 


0.171 


5.41 


5.10 


0.73 


0.82 


5.23 




3.82 




5.13 


7.20 


5.44 


A2219 


0.2281 


14.94 


15.50 


1.74 


2.16 


9.37 


10.90 


16.30 


23.54 


11.44 


18.90 


16.53 


A2255 


0.0809 


2.47 


2.90 


0.59 


0.67 


5.79 




3.42 




4.91 


5.90 


3.67 


A2256 


0.0581 


3.92 


4.60 


0.78 


0.82 


6.40 




4.99 




6.11 


7.20 


5.01 


A3266 


0.0602 


4.22 


4.90 


0.96 


1.06 


7.46 




7.16 




7.51 


9.20 


6.36 


A3558 


0.048 


3.54 


3.70 


0.67 


0.73 


4.78 




3.20 




4.77 


6.40 


2.97 


A3921 


0.094 


1.28 


3.10 


0.29 


0.62 


5.03 




1.46 




3.03 


5.40 


2.33 


RX J0043 


0.2924 


8.26 


7.70 


0.88 


0.92 


5.82 


7.59 


5.12 


6.98 


5.88 


8.10 


9.89 


Note. — 


Units are 


10*4 e 


rgs/s for Lx, 10"^* 


Mq for 


mass, keV for temperature, 


and 10^4 


M0keV for Yx and C 


'^D\Yq^. 



(B3) 



Precise definitions of all quantities are specified in the text. The list is divided by the blank row into cool core (upper) and no cool 
core (lower) systems. 

where r = (a;/0.045)^'^. Finally. [Arnaud et al.l (|2010( ) parameterize the pressure profile as 

P(x) oc 7-5 — 

with c = 1.177, 7 = 0.3081, a = 1.0510, and (3 = 5.4905. 

We use each of these profiles to apply aperture corrections for observables when necessary. Specifically, given 
an observable X evaluated at R ~ i?,500j we investigate how a bias b in the mass estimate propagates into cluster 
observables due to the choice of aperture. To do so, we use the above models to estimate the logarithmic slope 
e = d\nX/d\nR, so that X oc i?*^ at i? = i?5oo- Since R500 « M^/^, we see that bias b in the mass will result in a bias 
6*^/^ in the observable of interest. 

We first consider luminosity. Before we evaluate the logarithmic slope for Lx, we first use our model to learn how 
to rescale total luminosities to luminosities within a circular aperture i?5oo ■ As detailed in iPiffaretti et al.l ()2011|) , 
the total soft X-ray band luminosity is relatively insensitive to temperature, so that Lx ex J dVp^^, from which one 
finds Lx{RrMo) / Lxioo) = 0.91, where ix(^50o) is the luminosity within a circular aperture R = R500 (so that the 
integration region is cylindrical). In this calculation, we have truncated the integral at i? = 5i?5oo, a practice we adopt 
throughout. Our results are very weakly dependent on this choice. In practice, however, this correction appears to 
overestimate the luminosity contribution of the cluster outskirts, so we use a value of 0.96 recovered from explicit 
rcanalysis o f clusters by A. Mantz (private communication). 

Using the iPifFaretti et all (j2011h model, we calculate the logarithmic slope e — din L/ din R at R — R500, finding 
e = 0.16. Thus, Lx oc 6" °^ where b is the bias in the mass. This is an exceedingly weak dependence: setting 6 = 1.3, 
we find that the corresponding change in luminosity is just over 1%. Similarly, we have Mgas cx: / dVp^as where the 

R500 of e 



integral is now over a spherical region. Our adopted model results in a logarithmic slope at R 

Mgas OC 6"-4" 



1.20, or 



We next turn to temperature. Following iMazzotta et al.l ()2004l ). we assume that the spectroscopic temperature Tx 
is given by / dVwT/ J dVw where w = pl^^T'^''^ (see also lVikhlini3l2006f ). We find Tx oc fo-o i^. This is a sufficiently 
weak dependence that it can be ignored for our purposes, and reflects the fact that the temperature profile is very flat 
at around R w 0.15i?5oo- Finally, because Yx is dcflned as the product of Mgas and Tx, the sensitivity to bias is just 
the product of the bias scalings for Mgas and Tx individually. Thus, one has Yx oc b^ '^^ x b~^'^^ = b^ "^^. 

Finally, turning towards the integrated pressure oc / dVP, using the lArnaud et al.l (|201Cl[ ) profile, we find Y oc 
at i? = i?5oo- Consequently, a bias b in the mass results in a bias 6°-^'' in the integrated Y. 
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Fig. 6. — Left panel: (Pll-LS)-MlO mass ofTsct, A In M, as a function of rodshift. The yellow band shows the la confidence band of the 
best fit scaling relation, while the red band shows the redshift evolution induced by selection effects in a model where the Ysz/Yx ratio 
scales with Ygz as observed in the right panel. Right panel: The {P11-LS)-M10 offset AlnM, as a function of the SZ signal Ygz- The 
yellow band shows our best fit model. The clusters are split into low and high redshift clusters for illustrative purposes only: the fit uses 
all galaxy clusters (after outlier rejection). The red solid lines shows the scaling induced by selection effects alone pl us intrinsic redshift 
evol ution as observe d in the left panel. In both panels, the solid and dashed blue lines are the predicted scalings for the lPratt et al.l l|2009l ) 
and lLin et al.l II2012I) /gas models. 



C: INTERPRETING THE EVOLUTION IN THE (Pll-LS)-MlO COMPARISON 

There appears to be redshift evolution in the (Pll-LS)-MlO mass ratio. Here, we quantify the significance of this 
evolution, and attempt to determine which of the two data sets might be systematically different at high redshift. We 
do not arrive at any firm conclusions, but do find some hints of possible systematics that require further investigation. 

Significance of the Observed Evolution 
We quantify the evolution in the (Pll-LS)-MlO comparison by fitting the redshift behavior of the mass differences 

to 

AlnM(z) = a + 7ln [ 

As per the discussion in section II. 1[ we use uniform weighting in our fits, and rely on bootstrap resampling for all 
statistical inferences. Cluster A2034 is not included in the fits. We find 7 = —1.90 ± 0.44, which differs from 7 = at 
4.3cr. Thus, the observed evolution is statistically significant. However, part of this evolution is not intrinsic redshift 
evolution, but rather induced evolution: because of volume effects, the (Pll-LS)-MlO cluster sample at high redshift 
is more massive than the low redshift sample. If the systematic mass offset scales with mass, then this selection effect 
would induce apparent redshift evolution. To test for this possibility, we fit a power-law model with lognormal scatter 
to the mass offset A In M as a function of Fsz . We then randomly assign a new A In M to each cluster based on its 
own Ygz value, irrespective of redshift, and measure the resulting redshift evolution. The whole procedure is iterated 
10^ times to evaluate the mean induced evolution and the corresponding uncertainty. 

The left panel in Figure [6] shows the A In A/ vs z for the (Pll-LS)-MlO comparis on. The yellow band is the best 
is the 68% confidence region for a power-law model for the evolution, i.e. Equation ICll The induced evolution due 
to scaling of AlnAf with mass is shown with red error bands. The two are consistent, but there is some suggestion 
that there is intrinsic evolution in AlnM with redshift. We have also carried the reverse calculation: i.e. used our 
best fit redshift evolution to make synthetic realizations of our data so as to determine the induced Ysz evolution. 
The corresponding data, observed evolution (yellow band) , and redshift-induced evolution (red band) are shown in the 
right panel of Figure |6l 

In short, it is apparent from Figure |6] that the mass offset in the (Pll-LS)-MlO comparison depends on cluster 
mass, or cluster redshift, or, more likely, both. We now consider two alternate hypothesis: 1- the masses in Pll-LS are 
correct, and the evolution in AlnM is entirely due to a systematic effect in MIO. 2- the masses in MIO are correct, 
and the cvoultion in A In A/ is entirely due to a systematic effect in Pll-LS. 

Are the \Mantz et al\ 1(20 lOH ) Masses Systematically Evolving? 

The foundation of our investigation is a critical observational fact: all groups are in excellent agreement when it 
comes to estimating the Mgas(-R) profile. Since the masses in MIO are based on Mgas, it follows that any bias must 
ultimately be sourced by the constant /gas = /o model adopted in MIO. Assume then that some other /gas(M, z) is 
correct. Assuming Mgas(i?) gc at R = R500, one has Mgas(i?) = M fgas{M){R/ R^ooV ■ Given /o, MIO assigns a 
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mass by the condition Mgas(i?o) — foM = IoCRq where C = {4tt/3)AcPc- Setting the two expressions for Mgas(i?o) 
equal to each other, we can solve for Rq/R/ in terms of /gas//o- Converting to mass, we arrive at 



where we used 7 = 1.2 as per Appendix [Bl 

If one sets /gas(M, z) to a model consistent with the Pll-LS data, one will recover the observed evolution in the 
(Pll-LS)-MlO; as this is simply a restatement of the fact that the two works recover the sa me M^^^(R) profile . We 
demonstrate this explicitly. The left panel in Figure |6] shows the predicted offset from the iPratt et al.l ()2009[ ) /gas 
model, where the redshift dependence is included assuming an /gas evolution model /gas oc {M / M^.i^z))"' where a is the 
slope of the fgs.s-M relation. In lLin et al.l (|2012[ ). the observed evolution is (1 + 2:)° '^^, which corresponds to a = 0.13. 



The z-dependence for a = 0.21 (IPratt et al.l 12001) is therefore (1 + 2)0-4ixO-2i/o.i3^ ^ M°-^\l + z)°-^^. The 



total evolution predicted by this model is the b lue solid line, in excellent agreement with the observed (Pll-LS)-MlO 
offset. Also shown as a blue dashed line is the iLin et al.l (j2012i ) model, which undcr-predicts the observed evolution 
because of the shallower /gas-M slope. 

The take-home message is that the MIO masses can only be biased if the constant /gas model is incorrect. The 
question then becomes, can we be confident that /gas scales with mass and/or redshift? From a theoretical point of 
view, the expectation from hy drodynamic simulat ions with no feedback or gas cooling is a weakly evolving gas fraction 
with mas s or redshift (see e. g. iStanek et al.ll20l"oh. Some simulations that include baryonic feedback exhibit constant 
/gas fe.g. iFabian et al.[|201lD . but others do not |Nagai et al.ll2007l : IVikhlinin et a l. 2009a). In short, theory does not 
yet give us an unequivocal answer as to whethe r /gas scales with m ass and/or redshift. 

Observationally, things are also not clear cut. I Allen et al.l (|2008D finds that /gas does not scale with cluster temper- 
ature: /gas oc 7"0 02±o.06^ Suggesting no mass-scaling. We note, however, that the fact that these are not core- e xcised 
temperature, cool-core clusters complicates the interpretation of this measurement. Moreover, I Allen et al.l (|2008f ) 
worked at i?2500 rather than i?5oo- If we ignore these difficulties, and assume a self-similar scaling T oc M^/^, we 
find that /gas oc M^-^'^^^-^'^ (though the slope of the Ty^-M relation is itself the source of active investigation, see e.g. 
iMantz fc Allen|[2011[ ). at a pivot point Mp k. 10^^ Mp^. O ther representative values for the slope of the fgas-M relation 
from the literature are a = 0.13 ± 02 dvikhlinin et al.| [2009ah. a = 0.21 ± 0.03 (|Pratt et al.ll2009D . a = 0.09 ± 0.03 
iSun et all (|2009f ). a = 0.13 ± 0.03 (|Lin et al.l l2012[). The range of values in the literature tend to span the range 
a G [0.1, 0.2], which is in mild to modest ten sion with the [Allen e t al. (20QS) value. These differences can be reconciled 
if the fgas~M flattens at high masses, as the lAllen et al.l (|2008D has a higher pivot point that the remaining studies. If 
this is not the case, then there are clearly systematics that remain to be ful ly addressed . Som e examples of possible 
systematics are the use of X-ray temperatures that are not core-excised in en et al.l (|2008D . or parameterization 
svstematics as argued bv lMantz fc AllenI ()2011[ ). 

Things are even l ess clear when it comes to the redshift dependence of /gas. In both lAllen et al.l (|2008| ) and 
IMantz et al.l (|2010bt ). it is assumed that there is no intrinsic redshift evolution in /gas, and any redshift evolution 
is interpreted as evide nce of errors in the underlying cosmo fogical mode l used to interpret the observational data. 
IVikhlinin et al.l ()2009a[) relied on numerical simulations from iNagai et al.) ()2007[ ) to inform their /gas model, which is 
in turn used to estimate M500 from the Mgas data, so again the redshift evolution of /gas was not directly probed. 
()Lin et al.ll20Tl measure /gas oc (1 -f- z)'-'-'^^^'^-^'^, but they had to assume self-similar evolution for the M-Yx relation, 
and even this is only 3cr away from a no-evolution model. Overall, the question of whether or not /gas evolves with 
redshift remains an open question. 



Hav hig considere d the po ssibility that the evolution in the (Pll-LS)-MlO comparison is due to systematic evolution 
in the IMantz et al.l (|2010b( ) masses, we now consider the converse scenario; we assume that the IMantz et al.l (|2010bl ) 
masses are correct, and the observed evolution is entirely due to systematic evolution in the Pll-LS masses, and test 
any observational implications of this scenario. 

Once again, the foundation of our exploration is the observation that the Afgas profile — and therefore the pgas (R) 
profile — is robustly and correctly estimated by all groups. It follows then that any evolution in the Pll-LS masses 
must be due to systematic trends in the X-ray temperature. We can test this hypothesis by constructing an SZ 
temperature Tsz = ^sz/Afgas that can be compared to the X-ray temperature Tx- Any evolution in this ratio — 
which is none other than the Ysz/Yx ratio — would signal a systematic trend in Tx with clust er redshift. Moreover, 
to expl ain the evolution in the (Pll-LS)-MlO comparison, the evolution in the Ysz/Yx ratio of iPlanck Collaboration! 
()2011b[) should evolve from low to high values as one moves higher in redshift. 

The top panel in Figure [7] shows the evolution in the Vs z/yx ratio using the 'Planck CollaborationI (|2011b( ) data. 
For the figure, we have multiplied the Yx value reported in iPlanck Collaboration (.2011b) by a factor of 0.95 so that 
it matches the definition adopted throughout the paper (i.e. Tx is to be measured within [0.15, l]i?5oo, see section 
13.4! for details). To quantify evolution, we fit a power-law to all clusters with z < 0.3, except for A2034, which has 
an incorrect redshift. The redshift cut is also motivated by the fact that all the cross-comparisons we have performed 
have all focused on z < 0.3 clusters. We note, however, that our conclusions are sensitive to this cut, a point that we 
will return to shortly. 
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Fig. 7. — The Ysz/Yy^ ratio as a function of rcdshift (left panel) and Ygz signal (right panel) in the lPlanck Collaboratio"nl Il2011bl) data. 
The best fit power-law model Ysz/^x oc (1 -|- z)^ results in 7 = 1.19 it 0.43, which differs from zero at the 2.7(7 level. The hypothesis 7 = 
can be ruled out at 99.6% confidence. Both of these results use only z < 0.3 clusters (dashed vertical line), as per the rest of the paper. 
The red-dashed horizontal lines denote the 68% confidence interval for the Ygz/Yx ratio for low and high rcdshift clusters, as shown. 



A power-law model Ysz/Yx oc (l + z)'' tit to the data results in 7 = 1.19 ±0.43, which deviates from the no evolution 
expectation at the 2.7a level. The 68% confidence band defined by our best fit model is shown in Figure [71 Note 
that while the Figure extends to 2: > 0.3, the fit only includes z < 0.3 clusters. The extrapolation of the fit to higher 
redshifts is shown by the dotted lines. We have also estimated the probability at which the hypothesis 7 = can be 
ruled out through a direct Monte Carlo experiment. First, we compute the average Ysz/^x ratio over all clusters, and 
estimate the corresponding scatter. We use a log- normal model to assign Ysz/Yx to every cluster independently of its 
rcdshift, and measure the corresponding rcdshift evolution as we have done in the data. The process is repeated 10^ 
times. The confidence at which we can rule out the 7 = hypothesis is the fraction of times for which the slope 7 in 
our synthetic data falls below the observed value. The 7 = hypothesis is rule d out with 99.7% confidence. 

As noted earlier, however, these results are sensitive to whether we cut the iPlanck Collaboration! (|2011b[ ) cluster 
sample at z = 0.3 or not. Removing the z < 0.3 redshift cut adds a few systems with lower Ysz/Yx ratios that drive the 
rcdshift evolution lower. Indeed, visual inspection of Figure [7] gives the distinct impression that there are two distinct 
cluster populations, split at z = 0.13. We have computed the mean Ysz/Yx ratio for z < 0.13 and z g [0.13, 0.3] galaxy 
clusters. For the low and high redshift systems we find ln(ysz/yx) = -0.06 ± 0.04 and \n{Ysz/Y - X) = 0.08 ± 0.04. 
These are shown as the red-dashed lines in the left panel of FigureUl The two values differ at the 2.5a level: interesting, 
but not high enough significance to draw robust conclusions. 

There are two things worth noting about this result. First: if this difference is real, it is not due to selection effects. 
Indeed, Pll-LS have already demonstrated that — because of the tight scatter between Isz and Yx — selection 
effects have a negligible impact on the recovered scaling relation. One might expect that higher S/N clusters — which 
preferentially reside at lower redshift because of the Planck beam — would have a higher Ysz/Yx ratio, which could 
in principle explain the observed evolution. However, this does not appear to be sufficient to explain the apparent 
offset. A linear fit to the Ysz/Yx ratio as a function of S/N result in Ysz/Yx oc (5'/iV)-° °^. The median S/N for 
z < 0.13 and z e [0.13,0.3] is S/N = 11.3 and S/N = 9.2 respectively. The induced shift in the Ysz/Yx ratio is 
A\n{Ysz/Yx)tnduced = 0.091n(11.3/9.2) = 0.02, which is to be compared to the observed shift of A ln(ysz/yx) = 0.14. 
Evidently, this type of selection effect cannot account for the observed difference between the two cluster populations. 

In addition, it is worth emphasizing that the Ysz/Yx ratio is larger than unity for high redshift systems. There 
is now ample empirical evidence that clusters are not isothermal, but rather have falling temperature profiles (e.g. 
iVikhlinin et"all 120061 [Pratt et al.l[20q7t iSun et"alll2009t lArnaud et all 120101 : iMoretti et al.ll2011| ). Because of the 
vs p weighting of the X-ray and SZ signals, the average X-ray temperature is dominated by a region interior to that 
dominating the SZ signal, which, in conjunction with a falling temperature profile, predicts Ysz/Yx < 1- The fact 
that the Pll-LS Ysz/Yx ratio at z G [0.13,0.3] violates this inequality is therefore problematic. 

For completeness, the right panel of Figure [7] shows the dependence of the Ysz/Yx ratio on Fsz, which acts as a mass 
proxy. We find {Ysz/Yx) oc Y^^°-"^^"-°^, which is consistent with zero. Note this implies that any induced evolution 
due to mass selection effects must also be minimal, in agreement with the Pll-LS results on the impact of selection 
effects on the Ysz-Yx relation. 



